Deriving Ground Truth Labels for Regression Problems Using Annotator Precision
نویسندگان
چکیده
When training machine learning models with practical applications, a quality ground truth dataset is critical. Unlike in classification problems, there currently no effective method for determining single value or landmark from set of annotations regression problems. We propose novel deriving labels problems that considers the performance and precision individual annotators when identifying each label separately. In contrast to commonly accepted computing global mean, our does not assume annotator be equally capable completing specified task, but rather ensures higher-performing have greater contribution final result. The selection described within this paper provides means improving input data model development by removing lower-quality labels. study, we objectively demonstrate improved applying simulated where canonical position can known, as well sample collected crowd-sourced
منابع مشابه
Exploiting Intra-Annotator Rating Consistency Through Copeland's Method for Estimation of Ground Truth Labels in Couples' Therapy
Behavioral and mental health research and its clinical applications widely rely on quantifying human behavioral expressions. This often requires human-derived behavioral annotations, which tend to be noisy, especially when the psychological objects of interest are latent and subjective in nature. This paper focuses on exploiting multiple human annotations toward improving reliability of the ens...
متن کاملPrecision and Recall Without Ground Truth
In this paper we present a way to use precision and recall measures in total absence of ground truth. 1 Precision and Recall 1.1 General Definitions and Notation Precision Pr and Recall Rc (and often associated F-measure or ROC curves) are standard metrics expressing the quality of Information Retrieval methods [8]. They are usually expressed with respect to a query q (or averaged over a series...
متن کاملDeriving Labels
This paper builds on Hornstein’s (2005) proposal that Merge (Chomsky, 1995) can be split up into the more basic operations Concatenate and Label. This opens up the possibility that Concatenate applies without Label to generate flat structures, an option which Hornstein & Nunes (2005) explore for encoding adjuncts. The central claim is that Label is not needed as a syntactic primitive if a long ...
متن کاملUsing objective ground-truth labels created by multiple annotators for improved video classification: A comparative study
We address the problem of predicting category labels for unlabeled videos in a large video dataset by using a ground-truth set of objectively labeled videos that we have created. Large video databases like YouTube require that a user uploading a new video assign to it a category label from a prescribed set of labels. Such category labeling is likely to be corrupted by the subjective biases of t...
متن کاملA Ground Truth Inference Model for Ordinal Crowd-Sourced Labels Using Hard Assignment Expectation Maximization
In this paper we propose an iterative approach for inferring a ground truth value of an item from judgments collected form online workers. The method is specifically designed for cases in which the collected labels are ordinal. Our algorithm works by iteratively solving a hard-assignment EM model and later calculating one final expected value after the convergence of the EM procedure.
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applied sciences
سال: 2023
ISSN: ['2076-3417']
DOI: https://doi.org/10.3390/app13169130